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Cut-Set Bounds for Multimessage Multicast Networks 
with Independent Channels and Zero-Delay Edges 
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Abstract 

We consider a communication network consisting of nodes and directed edges that connect the nodes. 
The network may contain cycles. The communications are slotted where the duration of each time slot 
is equal to the maximum propagation delay experienced by the edges. The edges with negligible delays 
are allowed to be operated before the other edges in each time slot. For any pair of adjacent edges [I, i) 
and (i, j) where {I, i) terminates at node i and (i,j) originates from node i, we say (£, i) incurs zero 
delay on (i,j) if (f, i) is operated before otherwise, we say (£,i) incurs a unit delay on ( i,j ). 

In the classical model, every edge incurs a unit delay on every adjacent edge and the cut-set bound is a 
well-known outer bound on the capacity region. In this paper, we investigate the multimessage multicast 
network (MMN) consisting of independent channels where each channel is associated with a set of edges 
and each edge may incur zero delay on some other edges. Our result reveals that the capacity region of 
the MMN with independent channels and zero-delay edges lies within the classical cut-set bound despite 
a violation of the unit-delay assumption. 


Index Terms 

Multimessage multicast networks, zero-delay edges, independent channels, cut-set bounds 

I. Introduction 

This paper studies time-slotted communications over networks consisting of nodes and directed edges 
that connect the nodes, and the networks may contain cycles. Each edge receives a symbol from a node and 
outputs a symbol to a node in each time slot, where the duration of a time slot is set to be the maximum 
propagation delays experienced by the edges. In practical communication networks, propagation delays of 
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different links may vary significantly due to different distances and different transmission medium (e.g., 
optical fiber, air, water, etc.) across different links. For example, links with relatively short distances 
have shorter propagation delays compared to those with relatively long distances, and links established 
through the optical fiber medium generally experience negligible propagation delays compared to links 
established through the water medium. In order to characterize the scenario where the propagation delays 
experienced by some edges are negligible compared to the delays experienced by the other edges, we 
allow the edges with negligible delays to be operated before the rest of the edges in each time slot. Since 
the symbols transmitted on earlier-operated edges may depend on the symbols output from latter-operated 
edges, we say that an edge (£, i) terminating at node i incurs zero delay on an edge (i,j) originating from 
node i if (£,i) is operated before (i, j); otherwise, we say (£,i) incurs a unit delay on Similarly, 

the network is said to contain zero-delay edges if there exists an edge that incurs zero delay on another 
edge; the network is said to contain no zero-delay edge if every edge incurs a delay on every other edge. 
Under the classical model, every discrete memoryless network (DMN) |JTJ Ch. 15] is assumed to contain 
no zero-delay edge because all the edges are operated at the same time. A well-known outer bound on 
the capacity region of the DMN that contains no zero-delay edge is the classical cut-set bound |2j. It 
is easy to construct a network with zero-delay edges whose capacity region is strictly larger than the 
cut-set bound. One such network is the binary symmetric channel with correlated feedback (BSC-CF) 
considered in |3j Sec. VII], which will be introduced in the next subsection. 

A. Two Motivating Examples 

1) A two-way channel: Consider a network that consists of two nodes denoted by 1 and 2 respectively 
and two edges denoted by (1, 2) and (2,1) respectively. Node 1 and node 2 want to transmit a message 
to each other. This is a two-way channel |J4]. In each time slot, node 1 and node 2 transmit X^ 2 ) and 
X( 2 . 1 ) respectively, and they receive YA,t) and V) respectively. All the input and output alphabets 
are binary, and the channel associated with edge (1,2) is a binary symmetric channel (BSC) while the 
channel associated with edge (2,1) is a discrete memory less channel (DMC) whose output may depend 
on the output of channel (1,2). In this network, channel (1,2) incurs zero delay on channel (2,1), i.e., 
node 2 can receive Y( 12 ) before encoding and transmitting X i:1 . \ ;■ We call this network the BSC with 
DMC feedback (BSC-DMCF), which is illustrated in Figure [lja). 

When Y( 2 .i) = -^( 2 , 1 ) + Y’( 1j2 ), the BSC-DMCF is also referred to as the BSC-CF in ||3j Sec. VII]. It has 
been shown in |3] Sec. VII] that the capacity region of the BSC-CF is strictly larger than the classical cut¬ 
set bound, where the classical cut-set bound is obtained under the assumption that the network contains 
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(a) BSC-DMCF / BSC-CF (b) BSC-IF 


Fig. 1. Examples of the two-way channel. 


no zero-delay edge while the capacity region of the BSC-CF is achieved when edge (1,2) incurs zero 
delay on edge (2,1). Consequently, we have the following conclusion: 

(*) The capacity region of some BSC-DMCF with zero-delay edges is strictly larger than the 
classical cut-set bound. 

However, Statement (*) is based on an important assumption: The outputs of the two channels can have 
correlation given their inputs. In other words, the noises of the two channels can be correlated given the 
channel inputs. If the noises are assumed to be independent given the channel inputs, i.e., 

Pr{T)i 2) = a, T(2,i) = M^(i,2) = c >^(2,t) = d} 

= PrjY)^) = a|X( 12 ) = c}Pr{Y( 2 ,i) = M^( 2 ,i) = d} ( 1 ) 

for all (a, 6, c, d) G {0,1} , then it is not clear whether Statement (*) still holds. To facilitate discussion, 
we call the BSC-DMCF which satisfies (jT[) the BSC with independent feedback (BSC-IF), which is 
illustrated in Figure [ljb). Indeed, the BSC-IF with zero-delay edges always lies within the classical cut¬ 
set bound due to the fact that the two channels are independent and the well-known fact that the presence 
of instantaneous feedback does not increase the capacity of a point-to-point channel [TJ Sec. 7.12], 
Consequently, Statement (*) does not hold for the BSC-IF. This motivates us to investigate a general 
network with zero-delay edges under the assumption that the channels are independent, and compare its 
capacity region with the cut-set bound. 

2) A two-relay network (TRN): Consider a two-relay network (TRN) illustrated in Figure [2j which 
consists of a source denoted by 1, two relays denoted by 2 and 3 respectively, and a destination denoted 
by 4. The source wants to send information to the destination with the help of the two relays. Suppose 
all the input and output alphabets are binary. In each time slot, Xuj^ is transmitted on edge (i, j) by 
node i and Y(i. : j) is received from edge (i.j) by node j for each edge (i. j) in the relay network. In 
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Fig. 2. TRN-CN / TRN-IN 


addition, suppose 

1(1,2) = U, (2) 

1(2,3) = V, (3) 

and 

1"(1,4) = l / (2,4) = 1(3,4) = * (1 ,4) + -^(2,4) + ^(3,4) + U + V (4) 

where U and V are two independent Bernoulli random variables with 

Pr{?7 = 0} = Pt{V = 0} = 1/2. (5) 

To facilitate discussion, we call the TRN described above the two-relay network with correlated noises 
(TRN-CN). It can be easily seen that if edge (1,2) incurs zero delay on edge (2,4) and edge (2,3) 
incurs zero delay on edge (3,4), then node 4 can receive one bit per time slot from node 1 with the 
help of nodes 2 and 3 sending U and V respectively (cf. Q). On the contrary, if either edge (1,2) 
incurs a delay on edge (2,4) or edge (2, 3) incurs a delay on edge (3,4), then node 4 cannot receive any 
information from node 1 because the independent uniform bits U and V cannot be completely cancelled 
simultaneousljQ Consequently, the capacity of the TRN-CN with zero-delay edges is strictly larger than 
the classical cut-set bound, i.e., 0. 

Consider another TRN with the same topology as illustrated in Figure [2] and specified by 

— %(i,j ) ( 6 ) 

*As usual, the noises U and V generated in different time slots are assumed to be independent. 
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for all (i.j) / (1,4) and 

^(1,4) = -^(1,4) + -^(2,4) + -^(3,4) + -^(1,4) (7) 

where are independent Bernoulli random variables. To facilitate discussion, we call the TRN 

described above the two-relay network with independent noises (TRN-IN). It can be easily seen that the 
capacity of the TRN-IN, which is equal to the capacity of channel (1,4) specified in (|T]|, coincides with 
the cut-set bound even when zero-delay edges are present. 

B. Multimessage Multicast Network 

In this paper, we consider the multimessage multicast network (MMN) |5, Ch. 18] consisting of 
independent channels, where the destination nodes want to decode the same set of messages transmitted 
by the source nodes. Two simple examples of the MMN with independent channel are the following two 
networks introduced in Section |FA| — the BSC-IF (where both nodes want to decode all the messages) 
and the TRN-IN, which belong to the class of MMNs consisting of independent discrete memoryless 
channels (DMCs) |[6j. Note that the BSC-CF, unlike the BSC-IF, does not belong to the class of MMNs 
with independent DMCs because the forward and reverse channels of the BSC-CF are correlated (cf. 
Figure [lja)). Similarly, the TRN-CN, unlike the TRN-IN, does not belong to the class of MMNs with 
independent DMCs because the noises among the channels of the TRN-CN are correlated (cf. Figure [2] 
@ and @). 

We propose an edge-delay model for the discrete memoryless MMN (DM-MMN) with independent 
channels and zero-delay edges. In our model, each channel is associated with a set of directed edges 
(e.g., a channel characterized by q Y(1 i) y ( , 2 4) \x {14l ,x (2 4) is associated with {(1,4), (2,4)}) and the channels 
are operated in a predetermined order so that the output random variables generated by earlier-operated 
channels are available for encoding the input random variables for latter-operated channels. Therefore, 
an edge may incur zero delay on another edge under our model. Our edge-delay model can be used to 
investigate the practical situation when some edges with negligible propagation delays are allowed to be 
operated before the rest of the edges in each time slot (for instance, in a cellular network, edges that 
connect the mobile users to their closest relays may experience negligible propagation delays compared 
to those that connect the relays to the base stations). The channels of the MMN are assumed to be 
independent, meaning that the outputs among the channels are independent given their inputs, but the 
outputs within a channel are allowed to correlate with each other. 
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C. Main Contribution and Related Work 

The main contribution of this paper is twofold: First, we establish an edge-delay model for the MMN 
consisting of independent channels which may contain zero-delay edges. Our model subsumes the classical 
model which assumes that every edge incurs a unit delay on every adjacent edge. Second, we prove that 
for each DM-MMN consisting of independent channels with zero-delay edges, the capacity region always 
lies within the classical cut-set bound despite a violation of the classical unit-delay assumption. Combining 
our cut-set bound result with existing achievability results from network equivalence theory j6] and noisy 
network coding (NNC) (7J, (8j, we establish the tightness of our cut-set bound under our edge-delay 
model for the MMN with independent DMCs, and hence fully characterize the capacity region. More 
specifically, we show that the capacity region is the same as the set of achievable rate tuples under the 
classical unit-delay assumption. The capacity region result is then generalized to the MMN consisting of 
independent additive white Gaussian noise (AWGN) channels with zero-delay edges. 

The MMN with independent DMCs has been investigated in [9j under the node-delay model proposed 
in |j3] for general discrete memory less networks. The precise definition of zero-delay nodes in [j3], Sec. IV] 
can be expressed under our edge-delay model as follows: A node incurs no delay if and only if every 
incoming edge of the node incurs no delay on every outgoing edge of the node. In 0, it was shown that 
the capacity region of the MMN with independent DMCs is equal to that under the classical unit-node- 
delay assumption even when zero-delay nodes are present. Under the node-delay model, all the outgoing 
edges of each zero-delay node must be operated simultaneously after all the incoming edges of the node 
have been operated (cf. (9i Definitions 1 and 2]) while under our edge-delay model, the incoming and 
outgoing edges of a node can be operated in some predetermined order (cf. Definition 00 and [4]). For 


example, for the TRN-CN described in Section I-A2 edge (2, 3) can be operated before edge (2,4) under 
our edge-delay model but not under the node-delay model. In this work, we strengthen the main result 
in |9j under our edge-delay model for the MMN with independent DMCs and show that the capacity 
region remains unchanged even when zero-delay edges are present. 

It was shown by Effros [10] that under the positive-delay assumptior^in the classical setting, the set of 
achievable rate tuples for the MMN with independent channels does not depend on the amount of positive 
delay incurred by each edge on each other edge. Our capacity result for the MMN with independent 
DMCs (as well as AWGNs) complements Effros’s finding as follows: The set of achievable rate tuples 


2 Effros’s framework does not consider zero-delay edges, which can be seen from the encoding rules stated in 110 
that assumes X t .k is a function of (W4,f' fc_1 ). 


Def. 1] 


April 27, 2016 


DRAFT 




7 


for the MMN with independent DMCs (as well as AWGNs) does not depend on the amount of delay 
incurred by each edge on each other edge, even with the presence of zero-delay edges. From a practical 
point of view, the capacity region of the MMN with independent DMCs (as well as AWGNs) is not 
affected by the way of handling delays among the channels or how the channels are synch ionized, even 
when zero-delay edges are present. 


D. Paper Outline 

The rest of the paper is organized as follows. Section [[[[presents the notation. Section [TIT] formulates our 
edge-delay model for the DM-MMN with independent channels and zero-delay edges and state our cut¬ 
set bound result, whose proof is contained in Section [TV] In Section |V| we use our cut-set bound to prove 
the capacity region of the MMN with independent DMCs and zero-delay edges. Section [VI] generalizes 
our cut-set bound result to the MMN consisting of independent AWGN channels with zero-delay edges 


and characterizes its capacity region. Section VII concludes this paper. 


II. Notation 

The sets of natural, real and non-negative real numbers are denoted by N, M and R + respectively. We 
use (a)+ to denote max{a,0}. We will take all logarithms to base 2. We use Pr{A} to represent the 
probability of an event E. We use an upper case letter X to denote a random variable with alphabet 
X, and use a lower case letter x to denote the realization of X. We use X" to denote a random tuple 
(X\, X 2 ,..., X n ), where the components Xj.. have the same alphabet X. 

For any arbitrary random variables X and Y, we let px,Y and p Y \x denote the probability distribution 
of ( X , Y) (can be both discrete, both continuous or one discrete and one continuous) and the conditional 
probability distribution of Y given X respectively. We let px. Y (x, y ) and p Y \x(y\ x ) be the evaluations of 
px,Y and p Y \x respectively at ( X , Y) = (x. y ). To avoid confusion, we do not write Pr{X = x. Y = y } 
to represent px, Y (x, y) unless X and Y arc both discrete. We let pxPv\x denote the joint distribution of 
(X,Y), i.e., PxPy\x( x , y) = Px(x)p Y \x(y\ x ) for all x and y. If X and Y are independent, their joint 
distribution is simply pxPY■ We let A f{ -: /./, a 2 ) : M —y [0, oo) denote the probability density function 
of a Gaussian random variable whose mean and variance are p and a 2 respectively, i.e., Af(z; p, a 2 ) = 


For any random tuple ( X , Y, Z) distributed according to px,Y,z, we let H Px z (X\Z) and I Px YZ (X; Y\Z) 
be the entropy of X given Z and mutual information between X and Y given Z respectively. If 
X is continuous, we let h px z (X\Z) be the differential entropy of X given Z. If X, Y and Z are 
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distributed according to px,Y,z and they form a Markov chain, we write (X —> Y —> Z) p . For 
simplicity, we drop the subscript of a notation if there is no ambiguity. The -distance between two 
distributions px and qx on the same discrete alphabet X, denoted by \\px — qx\\cn i s defined as 
II Px ~ qx\\ci = T,x&x \Px(x) - q x (x)\. 

III. Multicast Networks Consisting of Independent Channels 

A multimessage multicast network (MMN) consists of N nodes and N 2 directed edges. The MMN 
may contain cycles. Let X = {1,2,..., N} be the index set of the nodes, and let £ =1x1 be the 
index set of the edges. Let VC X and V C X be the sets of sources and destinations respectively, where 
each source in V transmits one message and each destination in V wants to decode all the messages 
transmitted by the sources in V. We call (V, V) the multicast demand for the network. The sources in V 
transmit information to the destinations in V in n time slots (channel uses) as follows. Node i transmits 
message II) <G {1.2,..., M, } for each i <E V and node j wants to decode {IT) : i £ V} for each 
j £ V, where Mi denotes the size of II). We assume that each message IT) is uniformly distributed over 
{1, 2,..., M{\ and all the messages are independent. For each k £ {1, 2,..., n} and each (i,j) £ £, 
node i transmits {AX^\ and receives {Y(e.f),k}tei i n the k th time slot, where X^^\ k is the symbol 
transmitted on edge (?, £) by node i and Y{c :)P k is the symbol received from edge {£. j) by node j. The 
alphabet sets of Xu^ k and Y( l])k are denoted by Xuj) and Y^j) respectively for each (?) j) £ £, 
where and Y(ij) do not depend on the time index k. After n time slots, node j declares II)j to 

be the transmitted II) based on (IT), {Y]^}^!) for each (?) j) £ VxD. 

To simplify notation, we use the following conventions for each T \, T -2 C X: We let II'V, — (H) : i £ 
Xi) be the subtuple of (H), IT),..., Wn), and let ITt iX t 2 — (IT): (?, j) £ T\ x T 2 ) be the subtuple 
of (IT), 1 , IT), 2 , • • •, Wn,n)- For any ^-dimensional random tuple AX 2 ),..., Xr NjN ], we let 

Xt iX t 2 — (X(jj) : (i,j) £ Ti x X 2 ) 

be its subtuple. Similarly, for any k £ {1,2,..., n} and any random tuple (X ( - U jj,, X( L2 ).k- ■ • • ■ Xr N , jv),fe), 
we let 

Xt iX T 2 ,/c — ( X (i,j),k '■ ()j) G Ti X T 2 ) (S) 

be its subtuple. 

Definition 1: An a-dimensional tuple (fli, ■ ■ ■, consisting of non-empty subsets of £ is called 
an a-partition of £ if U^ =1 f \ = £ and I) n Oj = 0 for all i j. 
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Definition 2: The discrete network with independent channels consists of N 2 finite input sets Pc’; u ), 
ffn 2 ), ■ • •, X(n,n)’ N 2 finite output sets 34i,i), 3 '’(i, 2 ), • • ■, y(N,N) an d a channels characterized by con¬ 
ditional distributions qy^ ^ ,..., , where 12 = (Oi, Q 2 . ■ ■ . £l a ) is an a-partition of 

£. We call 12 the edge partition of the network. The discrete network is denoted by (Xs, 34:, a, 12, q) 
where q = (q^, ■ ■ ■, q^). 

Under our model, the discrete network is characterized by a channels denoted by all ^ , c/! 2) , „ ,..., 

qy j | Xs; as defined in Definition If a = 1, our model simplifies to the classical model which is 
characterized by only one channel denoted by qy^ X£ - The following example illustrates our model when 

a = 3. 


Example 1: For the BSC-IF described in Section I-A| we have X = {1, 2} and £ = {(1,1), (1, 2), (2,1), 
(2,2)}, where Tf( 12 ) = 3 ^(i, 2 ) = ^( 2 ,\) = 3 ^( 2 , 1 ) = {0)1}- Without loss of generality, we assume 
that = 3 ^ 1 , 1 ) = X( 2 , 2 ) = 3 ^( 2 , 2 ) = {0}- The edge partition is denoted by (12i, O 2 , U 3 ) where 


(3) 


is a 


Si! = {(1,2)}, Sl 2 = {(2,1)} and U 3 = {(1,1), (2,2)}. Since 3>(i,i) = ^ 2 , 2 ) = {0}, 
trivial channel which can cany no information. The other two channels of the BSC-IF are denoted by 
qy) | Y = qyy 2) \x {12 anc ^ ^Yn | v,. = lyy i,\x (2 l; rcs P cct 'vcly, which coiTespond to the BSC and the 
DMC in Figure |T}b) respectively. ■ 

Definition 3: An edge-delay profile, also called delay profile for simplicity, is an N 3 -dimensional tuple 
'■ (^, i,j) G X 3 ) where G {0,1} represents the amount of delay incurred by edge (£, i) on 

edge (i, j). The delay profile is said to be positive if it is the all-one tuple 1 = (1,1,..., 1). 

Under the classical model, only the positive delay profile is considered, i.e., every edge incurs a unit 
delay on all the edges. In our model, some elements of the delay profile can be zero, which indicates 
that some edge can incur zero delay on some other edges. However, if the delay profile contains too 
many zeros, deadlock loops may occur which result in each node waiting for the other nodes to transmit 
first before encoding and transmitting its outgoing symbols. The following two definitions enable us to 
formally define “good” delay profiles that do not induce deadlock loops. 


Definition 4: A channel operation sequence for the discrete network (ft};. 34’- a, 12. q) is some permu¬ 
tation of the tuple (1,2,..., a). The set of all channel operation sequences (permutations of (1,2,..., a)) 
is denoted by n. 

When we formally define a code on the discrete network later, a channel operation sequence (- 7 r(l), vr(2), 
..., 7 r(a)) 6 n together with a delay profile B = (brgjj) ■ G X 3 ) will be associated with the 
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code, where the a channels are operated in the order 

/jM 1 )) flM 2 )) rJ ( 7r («)) 

h 5 y ? • • • ? 

and b^j'j represents the amount of delay incurred by edge (£, i) on edge (i, j). If = 0, then node i 

receives Yy^ k before encoding Xojy k , and we say (£,i) incurs zero delay on if = 1, then 

node i receives after encoding Xujy k , and we say {£. i) incurs a delay on (i. j). Under the 

classical model, a = 1 and the channel operation sequence can only be the tuple (1) (because there 
is only one channel), which implies that B can only be positive, meaning that the amount of delay 
incurred between every pair of edges can only be positive. In contrast, under our edge-delay model, the 
a channels can be operated in different orders and some elements of B can take 0 as long as deadlock 
loops do not occur (in a deadlock loop, every node wants to receive first and then encode, and hence no 
transmission can take place). Therefore our model is a generalization of the classical model. The essence 
of the following definition is to characterize delay profiles which will not induce deadlock loops in the 
network. 

Definition 5: Let (ft ’g,yg,a,rt,q) be a discrete network, and let 7r = (7r(l),7r(2),...,7r(a)) be a 
channel operation sequence. For each (i,j) E £, let t(yj) be the unique integer such that (i,j) E p 

Then, an edge-delay profile (b(£jj\ : (£, i,j ) E X 3 ) is said to be feasible with respect to tv if the following 
holds for each (. i,i,j ) E X 3 : If = 0, then tup < tyjy 

Under the classical model, the only possible delay profile is the positive delay profile, which is 
feasible with respect to any channel operation sequence according to Definition [5] The following example 
illustrates a delay profile for the BSC-IF which is feasible with respect to one channel operation sequence 
but not to another channel operation sequence. 

Example 2: For the BSC-IF as defined in Example [l] a = 3 because it is characterized by three 
channels. Since the third channel is trivial, we assume without loss of generality that it is always the 
last channel to be operated. Then, the two channel operation sequences of our interest are (1,2,3) and 
(2,1,3). Let 

fo if (£,i,j) = (1,2,1), 

Vxi) — t 

I 1 otherwise 

for all ( £,i,j ) E X 3 , and let B(i 2 .\) — {b(l,i,j) : (■£,*, j) £ X 3 ) be the delay profile where only &(i j2 ,i) is 0. 
Then, it follows from Definition [5j that X?(i 2 ,i) is feasible with respect to the channel operation sequence 
(1, 2,3) but B^ 2j i) is not feasible with respect to (2,1,3). An interpretation for the feasibility of If l2 . i j 
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is given as follows: If the channel is operated according to the sequence (1,2,3), i.e., q Y ^ | V is 
operated before , then Y( 12 ) will be generated before the activation of q Y \ X{21) - Therefore, 

node 2 can receive Yr 12 ) before encoding Xmy), and we say B (i 2 p) is feasible with respect to (1, 2, 3). 
If the channel is operated according to the sequence (2,1,3), then node 2 will receive Ym 2 ) after the 
encoding of X( 2 ,i)> and we say B( 12 ,i) is not feasible with respect to (2,1,3). 

We are ready to define codes that use the network n times. 

Definition 6: Let {Xg, Yg, a, Cl, q) be a discrete network. Let (V, V) be the multicast demand for the 
network. In addition, let 7r G II be a channel operation sequence, and let B = (fyqjj) : (L i,j) G X 3 ) 
be a feasible delay profile with respect to 7r. A (-7T, B, n, Mx)-code, where Mj — ( M \, M 2 ,..., M/y), 
for n uses of the network consists of the following: 

1) A message set Wj = {1,2,..., Mj} at node i for each i G I, where Mi = 1 for each i G V c . 

Message flq is uniformly distributed on VV,. 

2) An encoding function 


fa#* ■■ w >< jfcr* x 


-yk bf JV,, 

X y (N,i) 


X, 


(hi) 


for each (i,j) G £ and each k G {1,2,... ,n}, where f(ij) y k i s the encoding function for Xuj^ k 
at node i in the k th time slot such that X^jy k = /(i,j),fc(W*> Xu i)^’'^ '■ ^ *= X)- 

3) A decoding function fjij : Wj x y ix{j} W, for each (i, j) G V x 22, where is the decoding 
function for W t at node j such that Wij — 9i,j(WjXxx{j}( 

Given a (n, B,n, Mx)-code, it follows from Definition [h] that for each ( l,i,j ) G X 3 , edge (£, i) incurs 
a delay on edge (i,j) if \e,ij) = 1- If &( e,i,j) = 0, edge (£, i) incurs zero delay on edge (i,j), i.e., for 
each k G {1, 2,..., n}, node i receives Y^^ k before encoding X^^ k . The feasibility condition of B in 
Definition [5] ensures that the operations of any (tt, /i, n. A2x)-codc are well-defined for the subsequently 
defined discrete memoryless network; the associated coding scheme is described after the network is 
defined. 

Definition 7: A discrete network (Xg, 34’- Cl, q), when used multiple times, is called a discrete 
memoryless multimessage multicast network (DM-MMN) if the following holds for any channel operation 
sequence 7r = (7r(l), 7r(2),..., 7r(a)) and any ( 77 , B, n, Mj)-code: 

Let U k ~ ] = (Wj, Xf 1 , Yjk'~ l ) be the collection of random variables that are generated before the A: th 
time slot. To simplify notation, let 

h 

X 4 u n w{m) . (9) 

m= 1 
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Then, for each k G {1,2,..., ri} and each h E {1,2,, a}, 


Pr {U k 1 = u k = xq h ik ,YQh tk = y^,k] 


k -1 


= Pr {U k 1 = u k l ,Xdh ik = xnh, k ,Y Q h-i k = yni-\k}^Y^^ ) \x a ^^y^wA x ^(h),k) 


k -1 


Or {h)) 


( 10 ) 


for all u k ~ 1 , XQh k and y^h k . 

Following the notation in Definition [7J consider any (n, B,n, Mx)-code on the DM-MMN. In the A: th 
time slot, Xg jk and Yg jk are generated in the order 


by transmitting on the channels in this order qW 1 )), gM 2 )),..., gM“)) using the (7r, £?, n, Mj)-code (as 
prescribed in Definition |6j). Specifically, X^ (hh k and channel q( n ( h )) together define Yn w(hhk for each 
h E {1,2,..., a}. In addition, the feasibility condition of B in Definition [5] ensures that the encoding of 
XfL w(h ^ k specified in Definition |6] is well-defined because X^ (hhk depends on only the symbols generated 
before its encoding, which is formally shown in the following proposition. 

Proposition 1: Let 7r = (7r(l), 7 t( 2), ..., vr(o:)) be a channel operation sequence. Fix any (it, B, n, Mj)- 
code and fix an h G {1,2,..., a}. Then, for each (i,j) G X^^ k is a function of (Wi, Yjtfh-,, 

Y (xx{i})n^-\k) for each ke{l,2,..., n}. 

Proof: Let B = : ( l,i,j) G X 3 ) be a delay profile that is feasible with respect to tt and 

fix a (tt, B, n, Mj)-code. Fix a k G {1,2,..., n} and an edge (i,j) G By Definition |hj X^^ k 

is a function of (Wi, (Y^ '■ £ G X)). Therefore, it suffices to show that (Y^ i ^ t ’'’ 3) : £ G X) is 

a function of P(£ X {q.) n n£-i &)• Since each is binary, it remains to prove the following 

statement: For each £ such that 6^^ = 0, Y ( j q k is a function of 5qxx{i})no'*- 1 k )• T° this end, we fix 
an I that satisfies lyj, j :i = 0. Let tao and be the two unique integers such that (£,%) G ) and 

(z, j) G ..q. It then follows from Definition |5j that which implies that (£,i) G 

which then implies that q k is a function of Yp; x {i}) n n h - 1 &)• ■ 

Under the classical model, a = 1 (there is only one channel) and Xg k and Yg k are generated in the 
order 

Xg ik , Yg )k 

in each time slot k. The only channel operation sequence is (1) and it follows from Definition [5] the 
only feasible delay profile with respect to (1) is the positive delay profile, which implies that every edge 
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incurs a unit delay on all the edges. Consequently, the classical model for the DM-MMN is a special 
case of our edge-delay model when a = 1. We arc ready to define the capacity region of the DM-MMN 
using the following three definitions. 

Definition 8: For a (7T, B. n, Mj)- code, let 

K* 4 Pr{ U ( jj )eVx x> {Wij + Wi}} (11) 

be the probability of decoding error. 

Definition 9: Let 7r be a channel operation sequence and B be a feasible delay profile with respect 
to 7T. A rate tuple (R \. ID ,.... Rn), denoted by Rx, is (n. B) -achievable for the DM-MMN if there 
exists a sequence of (-7T, B, n, M%)- codes such that lirninf logMi > Ri for each i G X and lim Pfi rr = 0. 
A rate tuple is said to be n-achievable if it is (n. £>)-achievable for some B. A rate tuple is said to be 
achievable if it is 7r-achievable for some 7r. 

Without loss of generality, we assume that M, = 1 and R, = 0 for all i € V c in the rest of this paper. 
Definition 10: The (n, B)-capacity region, denoted by CJ,, of the DM-MMN is the set consisting of 
every (7r, B) -achievable rate tuple. The it- capacity region C n is defined as 

C’ = U C ’b ' 

B:B is feasible with 
respect to 7r 

The capacity region C is defined as 

C= [J C”. 

Tren 

The following theorem is our main result. 

Theorem 1: Let (Xs, ys, a, fi, q) be a DM-MMN with independent channels and zero-delay edges 
under multicast demand (V, V), and let 

ll out 4 \J p| | R x G I .E R i ^ L P*S .v e (■ X TxX ; YlxT' I X T c xl) | (12) 

Px £ ,y £ --Px £ ,y £ = TCl-.T^nV^V) 

be the classical cut-set bound for the DM-MMN with independent channels. Then for any channel 
operation sequence tv, we have 

c* c P out . 
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Hence, 


c c n out 


Remark 1: Theorem [T] subsumes the classical cut-set bound for a = 1, i.e., 


ecu n 

Px £ TCX:T»n2V0 


t>N 


V Ri< I (1) 


{Xtxx\ YzxT a \Xr c xZ 


Remark 2: Recall that the BSC-IF can be viewed as a multicast network as explained in Section |Tj 
Using the formulation of BSC-IF in Examples [T] and [2] we obtain by Theorem [T] that R\ is bounded 
above by the capacity of the BSC and R 2 is bounded above by the capacity of the DMC regardless of 
what order the two channels are operated. 

Remark 3: The cut-set bound in Theorem [T] is tight for the MMN consisting of independent DMCs 
|6}. This will be shown in Section |V| after the proof of Theorem |T] is presented in the next section. 

Remark 4: The cut-set bound in Theorem [I] can easily be generalized for (i) any multiple multicast 
demand where each source multicasts a single message and the destinations want to decode different 
subsets of the messages; (ii) any multiple unicast demand where each node transmits N — 1 independent 
messages to the other N — 1 nodes and each message is decoded by one node only. 


IV. Proof of Theorem Q] 

A. Using Fano’s Inequality to Bound Sum-Rates 

Fix a channel operation sequence tv. Let Rj be a 7r-achievable rate tuple for the DM-MMN denoted 
by (Xs, y£, a, ft, q). By Definitions [9] and 10 there exists a sequence of (tv, B, n, Mj)-codes such that 


n —>00 TL 


(13) 


for each i e X and 

lim Per t = 0. 

(14) 

Since 7r is a permutation of (1,2,.. 

n—too 

., a) by Definition [4] and 

= £ by Definition [TJ it follows 

that 

uj 

II 

-c 

Cf 

11 

Cl 

(15) 


h=l h— 1 


Fix n and the corresponding (tv, B, n, Mj)- code, and let p Wx x ,, Y << w> x t ^ ie probability distribution 
induced by the code. Fix any TCI such that T c n V / 0, and let d denote a node in T c n V. For the 
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(7T, B, n, Mi)- code, since the N messages W\ , W 2 ,. .., Wn are independent, we have 
^logM* = I PWx , Y n(W T] Y£ xTc \W T c) + H PWi ^(W t \Y^ xT .,W t .) 

i&T 

— 7 VW X ,Yf (Wt; 1? xT c|Wt=) + H PWi<Yi (W T \Y? x{d} , W d ) 

< W £ 4Wr;i? X Tc|WTc) + l + Pj^logM, (16) 

iST 

where the last inequality follows from Fano’s inequality (cf. Definition [8}. 

B. Using Discrete Memoryless Property to Simplify the Upper Bound 

Following ( fT6] ) and omitting the subscripts for the entropy and mutual information terms, we consider 

I(W T -,Y£ xT o\W T c) 

n 

= ^2(H(Y XxT o jk \ W T c, y £~^ c ) - H(Y XxTBtk \W x ,Y££ e )) 

k= 1 
n 

= ^(P(F (XxTo)na „ )i: |FF T c,y x fe -^) - H{Y { i xT . )nn ^ k \Wi,Y^)) 
k= 1 
n 

= y~! y~! ^(xxT c ) n n£- 1 ,fc) 

fe=i h:n T(h) n(XxT c )^0 

- ^’(^(xxT“)nn„ (h ),fcl^X)^xxr c ’'^(xxT c )nnir 1 ,fe)) (I 7 ) 

where (a) follows from Q and ( [15] ). Following (17] ), we consider 

^(XxTOn^ w , fe |WT=,^ x ^,y (XxTc)nn ,-i !i; ) 

- ^(^(x x rq n n„ w ,k I Wr«, ^ XxXc , Y' (XxT c )nn h-i , X (T c xX )nn„ (M ,fc) 

< H{Y( XxT o^ n n^ (h) ,k\ x {T^xi)nn, (hh k) (18) 

and 

H{Y{Xx.T c )nQ. iy(th) ,k\W^x, ^rxT<=’ ^(Xxrqnnirhfc) 

— -(7(y(xxTqnn w(fe ),fe|l^x, ^x x t°) ^(xxT^nn^- 1 ,*) ^Trw.fe) 

= ^(Y(xxTc)nn, (h)) fc|^n. ((0> fc) (19) 

for each k £ {1,2,..., n} and each h that satisfies fl(Ix T c ) / 0, where 
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(a) follow from the fact by Proposition |lj that X^ T ^xi)rn„ {h ),k i s a function of (Wr°, Pj x y c , ^ / (xxT o )nnlr 1 fc) 

(b) is due to the following Markov chain implied by < [TQ| ) in Definition [7| 

((Wx, iixT'i ^(XxT c ) n ni _1 ,fc) ^ ^ 

Combining < [T6| ), ( [17] ), ( fT8| ) and < p~9| > and using the fact that 7r is a permutation of (1,2,..., a), we obtain 

(l-O^^iogM, 

«eT 

1 n 

IE E (#(^(zxT3nfi fc ,fc|2((T°xX)nn h ,fc) _ #(^(XxT E )nfi h ,fcl-^n,,,fc)) (20) 

fc=i /i : n fe n(XxT=)^0 
j n a 

^EE (^(^(Xxr c )nQ h ,fcl^(T c xX)nn h ,fc) ~ #(^(XxT3nfi h ,fc|2(r2 h ,fc))- 


< - 
n 


< - 
n 


( 21 ) 


k =1 h —1 


C. Introducing Time-Sharing Distribution to Single-Letterize Entropy Terms 

Recalling that p Wz X n Y " w, L ' s t * ie probability distribution induced by the code, we define the time- 

1 


sharing distribution 


*■<*> * - 


( 22 ) 


for all k 6 {1, 2,..., n}. In addition, we define 


SQ n ,x £ , Qn ,Y £ ' Qn (k : x e ,y £ ) = PQA k )Px E , k ,Y £ , k {x£,y£) (23) 

for all k € {1,2,... ,n}, G <3^ and y £ G 32^:- Following ( |2T) ), we consider the following chain of 
inequalities for each h € {1,2,... ,a}: 


1 

n 


n 

E (H Px? y y (Y(z X T°)nn h ,k\X(T° xx)n n h ,k) ~ H Px n iY n(Y(z xT c)nn h ,k\ x n h ,k)) 

k= 1 


(a) ^ 
n 


fc=i 


S Qn 


'£,Qn ’ Y £,Qn 


(5 / ’(XxT‘=)nf2 h ,Q„|2^(T=xX)nf2 h ,g„) Qn 


k) 


- Hs Q n , Xe ,Q n *e, Qn ( iA (XxT‘=)nO h ,Q„|2^n h ,Q„, Qn — k)) 

j22} 

~ ^ S Qn,X £ Q ri ,y r £ Q n ^y^{XxT c )r\Q.hiQn \X(T c XZ)nflh,Qn ’ 

- Hs Qn,x £tQn ,Y EtQn {Y(zxTc)r\n h ,QjXn h ,Q n ,Qn) 

— ^ s Qn> x £,Q n . Y £ ,Q n (^?xxr c )nf2h,Q„ |2f(T c xXjnrtfuQn) 

_ Hs Qn,*£,Q n ,Y £tQn { Y (TxT<=)r\n h ,Q n \ X £l h ,Q n ,Qn) 
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®H. 


S Qn ,X £ Q n ,Y £ q,' 


(Y(xxT-)nn h ,QjX(T<=xX)nn h ,Q n ) 


- II. 


SQ n ,X £ Q n ,Y £ Q r 


(X(XxT-)nn h ,Q n I x n h ,Q n ) 


= I. 


Sx n h ,Qji' Y n h ,Q n ( X (TxI)nCl h ,Q n i ^(. XxT c )nU h ,Q n \ X (T c xI)nU h ,Q rl ) 


(24) 


where 

(a) is due to the following fact by (|23j): 


8 X e . Qn ,Ye,Q n \Q n ( x e,ye\k) = PXs, k ,YeAx£,ye) 


for each k, X£ and y£. 

(b) is due to the following Markov chain implied by 


(Qn x n h ,Q n ->• y(xxT-)nn h ,Q n ) s ■ 


Combining ( |2T| ) and (241, we obtain 

{i-p: n )Y J ^o g M i <Y J i sx 


n h ,Q„Xn h ,Q, 


{ x (TxX)nn h ,Q n ',Y(XxTc)nn h ,Qj x (T<=xT)nn h ,Q n ) (25) 


ieT h=X 

where for each h & {1,2,, a}, sx Qh Qn ,Y n q„ is a distribution on {Xsi hi yQ. h ) that satisfies 

h ,q„ , Ya h ,Q n {xfl h ) UXXh ) 

ED i 


^ y PXn h ,k,Y$u, .k (.Xflh i yxih ) 


Go} 1 


fc=i 

n 


n 


2 JI®nj) 


fc=l 


n 




(26) 


k =1 


Let {n^}^ be a subsequence of W^t such that Q„ ,dh h Q converges with respect to the C\- 
distance for all h £ {1,2,, a}, and we define the limit of sx a ,, n ,y„ Qn as 


s x a . ,Y n . {xn h ,yn J = lim sx n , , Q „, ,y n . , Q „, (ato h , ) 


£—>■00 


(27) 


for all (xQ h ,yn h ) e Ah,, x ^ and all h € {1, 2,..., a}. Since 


I s x n , , v ri , (2f(TxX)ntt/i 5 y(XxT c )n£ih 12^-(T c xX)nt2h) 
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is a continuous functional of sx nh ,Yn h f° r eac h h € { 1,2,... ,a}, we obtain from (25 I, ( [27] ), ( |T3| ) and 
( fT4| ) that 


T.Ri - y ^Xn.,y n| ( X (TxT)m h i Y (XxT<=)nn h \X(T°xpnn h )- (28) 

iET h =1 


Since 


s A n , ,y Q) , - SAT; 


(h) 

> q Yn h \X a 


for all h E {1,2,..., a} by ([26]) and ( [27] ), it follows from ( [28] ) that 

Q 

y R i< y, I s Xa gW lx (2^(TxX)ntt fc ; ^(ixT'lnsik I-^(t= xX)nn h ) • 

h I* S2), 


i£T 


h =1 


(29) 


Define 


Q! 



Jl=l 


It then follows from (291 that 


a 

y / R i < y ) Ip XF , Y £X(Txi)na h ',Y(XxTe)na h \ x (Tcxi)r\n h )- 

i£T h =1 


In order to simplify ([32]), we consider 


(30) 

(31) 


(32) 


Ip x £ ,y £ { X TxIl YixTc\Xtcxi) 
a 

~ y, Ipx £ , Y£ ( x TxX ; Y(XxT“)nQ h I x T°xi, Y(TxT<=) rn h ~ 1 ) 
h= 1 
a 

= yXHp.xr.Yr (X( lxT o)nn h \XT <=xz, YjixT-) nn h ~ 1 ) ~ ^px £ ,x £ ( 5/ (XxT‘=)no fe |2ff,3 / (x X T‘=)nn'‘- 1 )) 

/i=i 

a 

= y\Hpxr,Yr (Y(xxT-)nn h \X(T-xpnn h ) ~ r px £ ,y £ (Y(xxT-) nnj2Q:nfiJ) 

/).=i 

Q! 

= yipx £ ,Y £ (X(TxX)nft h 'i Y (ixT-)nn h \X^T-xi)ra h ) (33) 

/i=i 
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where (a) follows from the fact by ( |3T[ ) that (Xf i e ,Yn e ) and (Xq m ,Th m ) are independent for all ^ rri 
under the distribution px e ,Y e • Combining ([32]) and ([33]), we obtain 


^ Ri< Ipx £ ,Y £ i x Txl] izxT' |^T'-xl)- 
i&T 


(34) 


Since Px £ ,y £ satisfies ( |3Tj ) and depends on only the sequence of (7 r, B , n, Mj) -codes but not on T, ( |34| ) 
holds for all T C X that satisfies T c D V / 0, which implies from ( |3T| ) and ( fT2| ) that Rj € 77 out . This 
completes the proof. 


V. Multicast Network Consisting of Independent DMCs 

The MMN consisting of independent DMCs [6] consists of N 2 channels where each channel is 
associated with a directed edge. Define 


^N(i—l)+j — {(GJ)} 


(35) 


for each (i,j) £lxl such that q v 4 J ! y characterizes the DMC associated with edge (i. j). 

1 l An JV(i_0+3 

Then, this network can be viewed as a discrete network (Xs,ys, N 2 ,fl, q) according to Definition [ 2 } 
where ft and q arc defined as 

ft = (Di, fl2 ,..., fix 2 ) 


and 




q (2) ,...,q 


(N 2 )^ 


respectively. We assume that this network satisfies the discrete memory less property stated in Definition [7] 
Let C denote the capacity region of this network. To simplify notation, we let 


„ A (N(i-l)-h?) 


JV(i-l)+3' si JV(i-l)+j 


and let 


= max /, 


pa 


0 , 3 ) 




(36) 


(37) 


be the capacity of channel q Y(i ,, for each (i,j) £ Ixl Combining existing results with Theorem [TJ 
we fully characterize the capacity region of the MMN with independent DMCs and zero-delay edges in 
the following theorem. 

Theorem 2: For the MMN consisting of independent DMCs with zero-delay edges under multicast 
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demand (V, V), we have 


c=c 1 = p| [rx 

TCZ:T<=C\V ^0 


t>N 


ERi< E Caj) 

i&T (■ i,j)eTxT <= 


}' 


where C and C 1 denote the capacity region and the 1-capacity region respectively. 


Remark 5: It was shown by Effros [ 101 that under the positive-delay assumption in the classical setting, 
the set of achievable rate tuples for the MMN with independent DMCs does not depend on the amount 
of positive delay incurred by each edge on each other edge. Therefore, Theorem [2] together with Effros’s 
result implies that the set of achievable rate tuples for the MMN with independent DMCs does not 
depend on the amount of delay incurred by each edge on each other edge even when zero-delay edges 
are present. 

The achievability and converse proofs of Theorem [2] are presented in the next two subsections respec¬ 
tively. 


A. Achievability 
Define 

Our goal is to prove 


n 1 = p| {Rx g 

TCX:T<=nX>^0 


t>N 


E;st E | _ 


(i,j)eTxT c 


C DC 1 D7Z 1 


(38) 


(39) 


The proof combines existing results from network equivalence theory in {6jj and noisy network coding 
(NNC) in Q, @. Consider a deterministic counterpart of the MMN consisting of independent DMCs by 
replacing every DMC qy j} \x^ (J in the MMN with a noiseless bit pipe whose capacity is equal to 
(cf. ( [35] ), ( [36] ) and ([37])), and let Cj et be the capacity region of the deterministic counterpart network with 
unit-delay edges where 1 denotes the all-one delay profile. Letting C 1 denote the 1-capacity region of 
the MMN consisting of independent DMCs, we conclude by using the network equivalence theory |6 j 
that C 1 = C c } ct , which then implies from Definition 


10 


that 


c D c 1 = Ci v 


(40) 


By viewing 1Z 1 as the cut-set bound for the deterministic counterpart network, it follows from the NNC 
inner bound in [8, Sec. II-A] that C^ et D 1Z 1 , which then implies from (40> that (391 holds. 
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B. Converse 
We will prove 


C c n 1 


(41) 


by using Theorem [T] Suppose Rx is achievable. It then follows from Theorem [TJ 
exists a distribution 

N N 

PXeXe 

i=1 3 = 1 

such that for all T C X that satisfies T c n D / 0, we have 

^ Rj < Ipx e ,Y £ (XtxI', YixT<=\Xt^xi)- 

i£T 

Consider the following chain of inequalities for each TCI: 


and (36) that there 


(42) 


(43) 


Ipx £ ,Y E ( Xtxi ; YxxtAXt^xx) 

N 

= El Ipx £ ,Y £ ( X TxX', x T c I Xt° xl -i ^ {i'eX|i'<i}xT c 


i— 1 
N 


^ 1 y 1 Ipx £ ,Y £ {.YtxIi Y(iJ)\XT c xZi ^{i'GX|i'<j}xT c i ^{i}x{j'eT 0 b''<j}) 
*=1 j&T<= 


N 


^EE ( Hpx E ,Y £ 0^(i,j) \XT c xl) Hpx £ ,Y £ 0 (id) |-^£> ^ (i'eX|i'<i}xT c > ~^{i}x{j , eT°\j'<j})) 

i= 1fST c 


(a) 


IV 


(b) 


EE ( Hpx e ,Y £ 0^\i,j)\XT^xx) Hpx £ ,Y £ l^(i,j))) 

i=l j£T c 

EE ( Hp X£t Y £ \Xt°xx) Hpx £ ,Y £ |X(jj))) 

iST j&T° 

<EE dw,(r<«>) - »>■«,.>* ( y «)l*<«>>) 

iST jeT c 


EE'. 




(Em)’ Ed)) 


ter jeT c 


P4 




i&T j&T<= 


? E c (u) , 


(44) 


(i,j)£TxT° 


where 
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(a) follows from the fact by ( [42] ) that 

((^£\{(ij)}: ^£\{(ij)}) “4► X(ij) —> Y(i,j)) p 

forms a Markov chain for all (i,j) G £. 

(b) follows from the fact that for all (i,j) G T c x T c , 


Hpx e , y £ ( X{i,j ) I Xt<= xx) < Hpx £ , Pfa) I ) • 

Combining ( |43| ), (441 and (38i, we have R% G 77 1 . This completes the proof of ©• 


VI. Multicast Network Consisting of Independent AWGN Channels 


In this section, we generalize the result in the previous section to multicast networks consisting of 
independent additive white Gaussian noise (AWGN) channels with zero-delay edges. The MMN consisting 
of independent AWGN channels consists of N 2 channels, where the channel associated with edge (i. j) is 
an AWGN channel whose noise variance is denoted by o ^ ; j for each (i. j) e £. In each time slot k, node i 
transmits k G M on edge (i,j) and receives Yn j: , k G M from edge (£,i). In addition, we assume 
that the channel outputs are independent given their inputs, i.e., the channel noises are independent. Each 
codeword transmitted on (i. j) is subject to the power constraint 

p 'ht I kA f M }= 1 («> 


for each (i. j) G £, where denotes the average power available to edge Similarly, each node i 
is subject to the power constraint 


1 


N n 


Pr 


(46) 


j= 1 k= 1 

for each i G X, where P r denotes the average power available to node i. The network is subject to the 
power constraint 


Pr UEEEV«>,«G"- p } = 1 - 


N N n 


(47) 


i=l j= 1 k =1 


where P denotes the total average power available for transmissions in the network. To facilitate discus¬ 
sion, we write Ps = (P(ij) ■ (i, j) G £) and Pj = (Pi : i G X). 
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A. Network Model 

Definition 11: The MMN with AWGN channels, denoted by a positive-valued tuple (<r 2 ^ : 
consists of N 2 AWGN channels denoted by {qy (i ,,|x (l : (h j) G T x Z}, where qy characterizes 

the AWGN channel associated with the directed edge (i. j) such that 

for all <5 M 2 . 

In Definition |TT] we have not introduced the independence property among the channels yet. We will 
define the independence among the AWGN channels after we have specified codes that use the network 
multiple times. To facilitate discussion, we define 




(49) 


for each (i. j) G Z x Z such that = £ and F l £ n Dfj = 0 for all t f. In other words, 

(fii, Q 2 , ■ ■ ■, (4/v 2 ) is an ./V 2 -partition of £. As in Definition |4} we define a channel operation sequence 7r 
to be a permutation of (1,2,..., N 2 ), and define II to be the set of the permutations of (1,2,..., N 2 ). 
For a channel operation sequence 7 r and each delay profile B that is feasible with respect to tv (cf. 
Definition [5}, we define a (ir, B, n, Mi, Pe,Px, P)-code similar to Definition [6] with the additional power 
constraints ( |46| ) and ( [47] ). We are ready to formally define the MMN with independent AWGN 
channels. 


Definition 12: A MMN with AWGN channels (a 2 ^ : (i,j) G £) is called a MMN with independent 
AWGN channels if the following holds for any channel operation sequence 7r = (7 t(1),7t(2), ..., tt(N 2 )) 
and any (7T, B, n, Mx, Pg, Pj, P)-code: 

Let U k ~ 1 = (Wx, Xg -1 , Yg~ 1 ) be the collection of random variables that are generated before the A: th 
time slot. To simplify notation, let 

h 

ni ± J n n(m) , (50) 

m =1 

and let (ih,jh) be the unique edge such that 


^7r(/i) — {_idh, jh)}■ 


(51) 
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Then, for each k G {1, 2,..., n} and each h e {1,2,, a}, we have 


where 


PU k ~ 1 ,X n h tk ,Y n h k ~ Puk -^ X ^^ Y ^-\ k PY ^ h) AX^ {h) ^ 
™ PU k ~\X nh Y nh 1 PY, ih , jhhk \X (ihJhhl 


P y (i h J h '),k\Xu h ,j h ),k(y(ih,jh),k\ X ('i'h,jh),k) QY (ih , jh) \X { i hJh) (y(i h ,j h ),k\ x (i h ,j h ),k) 


(52) 

(53) 

(54) 


for all {x(i hjjh ) tk ,y( ihtjh ) t k) G (channel q Y(ihtjh) \x (ih , ih) was defined in <|48])). 

Given a multicast demand (V, V), we define the {tt, B)-achievability, the it -achievability and the 
achievability of a rate tuple Rx as in Definition |9j Then, we define the {tt. B)-capacity region denoted 
by CJj, the tt- capacity region denoted by C n and the capacity region 


C= \J C 7 


(55) 


7rGlI 


as in Definition 10 The following theorem fully characterizes the capacity region of the MMN with 


independent AWGNs and zero-delay edges. The achievability and converse proofs of the theorem will 
be provided in the following two subsections respectively. 

Theorem 3: Let (cr 2 ; ^ : (i, j) G £) be a MMN with independent AWGNs and zero-delay edges under 
multicast demand {V,V), and let 


(56) 


be a set of iV 2 -dimensional tuples which specify the power allocation for the edges in the network. Let 
Sg denote (S( U ), 5 (i >2 ), • • •, S^tv)) and define 


S{P £ ,P X ,P)±< 

(^(1,1)) • 

• > S(n,n)) G 

Ef=i Ef=i s (iJ) < p, j 

Ef=i S(i,j) < Pi for all tel, 




S(i,j) < P(i,j) for all (i,j) € £ ) 


^cut-set = |J fl <! R T £ 

S £ eS(P £ ,P x ,P) TCi:T<=nv^m 


»N 


E \ lo g ( 1 


i£T 


(■ i,j)€TxT- 


Sa.n 


(57) 


to be the classical cut-set bound. Then, we have 

C = C 1 = 77 cu t- S et, 

where C and C 1 denote the capacity region and the 1-capacity region respectively. 

Remark 6: Theorem [3] generalizes the capacity result in Theorem [2] to the MMN consisting of in- 
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dependent AWGNs with zero-delay edges. Although the extra power constraints in (451, (46 1 and ( [47] ) 
are introduced in the AWGN channels setting compared with the DMCs setting, we can still show in 
Theorem [3] that the set of achievable rate tuples for the MMN with independent AWGNs does not depend 
on the amount of delay incurred by each edge on each other edge even when zero-delay edges are present. 


B. Achievability 

Our goal is to prove 


C 4 C 1 D K cut _ set . 


(58) 


Since C D C 1 by 
by ((57): 

C 1 D p| {R Z € 

TCl:T°nT>^0 


, it suffices to show C 1 D 7Z C ut-set, which is equivalent to the following statement 


(JV 


E R i < E 2 lo S ( 1 + 

ie T ( i,j)£TxT c 




(59) 


holds for all Sg E S(Pg,Pz, P) and all 5 > 0. In order to show (59 1 , we fix an Sg E A( Pg. Pz, P) and 
a 6 > 0. Suppose we use a random Gaussian codebook with power S^ ^ — <5 for each edge (i,j) E 8 so 


that the rate 


-log 1 + 




(7 


(60) 


(*d) 


can be achieved for the AWGN q Y[ . |_x- (i (defined in ( |48) ) as n tends to infinity. In addition, the power 
constraints ( |45) , ( [46] ) and ( [47] ) hold with probability approaching 1 by the weak law of large numbers 
due to the random Gaussian codebooks and the fact that Sg E S(Pg,Pz, P) (cf. ([56)). In the rest of the 
proof, we follow the network equivalence and NNC arguments in Section |V-A| which have been used for 
the achievability proof of Theorem [2] 

Consider a deterministic counterpart of the MMN consisting of independent AWGNs by replacing 
every AWGN q Y(i \x >,, with a noiseless bit pipe whose capacity is equal to C(,S'^ ? . — 5) (cf. ©), 
and let C c j et be the capacity region of the deterministic counterpart network with unit-delay edges where 
1 denotes the all-one delay profile. Letting C 1 denote the 1-capacity region of the MMN consisting of 
independent AWGNs, we conclude by using the network equivalence theory |6] that 


i (a) i 

C 1 D Cl ct, 


(61) 


where (a) is not an equality because the capacity of every AWGN q Y( ( is C(,S’ f 7 ■.) rather than 
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C(S* J . ) - 5). Define 


n 1 = 


n { r i 

TCX:T c nX>^0 


tN 


E_Ri< £ c(^..)-<5) 


iST 


(' i,j)eTxT- 


(62) 


to be the cut-set outer bound for the deterministic counterpart network. It follows from the NNC inner 
bound in [8, Sec. II-A] that Cj et D 72 1 , which implies from ( |61[ ) that C 1 D 72 1 holds, which then implies 
from ( [62] ) and ( |60j ) that ( |59| ) holds. Since < [59| ) holds for all S£ G S(Pg, Pi, P) and all 5 > 0, ( f58| ) also 
holds. 


C. Converse 

Our goal is to prove 


CCK, 


— 7 '-'cut-set* 


(63) 


It suffices to show that 


C n C 72, 


_ /v cut-set 


(64) 


holds for all channel operation sequence tv, which will then imply from ( [53] ) that ( [63] ) holds. Fix a 


channel operation sequence 7r. In order to show ( [64] ), we fix an arbitrary delay profile B that is feasible 
with respect to 7r and let R,x be a (n, B )-achievable rate tuple. By definition, there exists a sequence of 
(tv, B, n, Mx, Pg, Pi, P)-codes such that 


for each i £ X and 


^i>R, 

n—>oo 71 


lim PI. = 0. 


(65) 


(66) 


where P e ”. is as defined in GD- Fix n and the corresponding (tv, B, n, Mi, Pg, Pi, P)-code, and let 
Pw x x n Y n w- L 6c the probability distribution induced by the code. Fix any TCI such that T c n 
V ^ 0. Following similar procedures for deriving ( [20] ) in Section IV we obtain from Fano’s inequality, 
Proposition [T] and ([52]) that 


(1-O^^logM, 


i&T 


1 


< X] {h PX 2, Y p(Y(ixT-)nn h ,k\X( T cxi)na h ,k) - h Px „ YS .(Y( XxT c)nn h ,k\Xn h ,k))- (67) 

fc=l h:Q h £lxT c 
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Following ( [67] ), we consider the following chain of equalities for each k G {1, 2,..., n}: 

X {hp x g, Y g.(Y(ixT-)nn h ,k\X(Tc x z)na. h ,k) ~ ^ (Y(x X T^nn h ,k\ x n h ,k)) 

h.:n h £lxT° 

<=) X (^PJfg,y»( y (XxT<=)n{(i,i)},A:l- X '(T<=xI)n{(*j)},fc) _ h Px „ Y „ (y(XxT=)n{(i,j)},fel^(iJ),fc)) 
(j,i)eXxT= 

- X] ( h Px2,r? ( Y (IxT‘)n{(i,j)},k) ~ h P x^, Y ^(X(XxT-)n{(i,j)},k\ X (iJ),k)] 

('i,i)eTxT- 

— y ^ {p'Px™,^"- ( X (i,j),k) — i X (i,j),k > (68) 

(i,j)eTxT c 

where 

(a) follows from the facts that , ^(tv 2 )) I s an -^ 2 partition of <5 and each contains 

exactly one edge. 

(b) follows from the fact that for each (i. j) G T c x T c , 


hpx?, Y ? {Y(ZxT'>)n{(i,j)},k\ x (T<>xZ)n{(i,j)},k) ~ h Px n tY? (' Y(ixT c )n{(i,j)},k\ x (i,j),k )■ 


Let ,) *. be the distribution of the zero-mean Gaussian random variable Zu^ k whose variance is 
a 1 ^ ■. for each (i,j) G £ and each k G {1,2,..., n}. It then follows from ([54]) and ( [48] ) that Y^^ k has 
the same distribution as Xu^ k + Z^j^ k when Yuj^ k and (Xuj) tk , Z^j^ k ) are distributed according 
to px\t,Ye an( l Px (i j) k Pz (i j, k respectively for each (i. j) G £ and each k G {1.2..... n}, which implies 
that 


1 n 

~ X X {hpxg.Yg i x (i,j),k) ~ h Px „ Y g (Y^j) k \X^j^ k )) 


k=l («J)sTxT' 

1 

n 


X X ^ l P x (i,j),kP z (i,j),S X {kj),k + Z(i,j),k) hp X{ . ^ th Pz, iJ)th (X(i,j),k + 

fc=l (i,j)eTxT° 

(69) 


Following ([69]), we consider 


P x (i,j),kP z (i,j),k 


( x (i,j),k + %j),fe) < log ^2ite (Vax p . Y(ij)ifc [% i))fc ] + Var PZ( . [%j),fc]) 


< log W27re (E Px 


‘(i.j).fc 


= log t/27re ( E px 




x r i 


X?. , 


+ Var 


(i,j),fc 


(J 


(*d) 


(70) 
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and 


^ l P x u,j),kP 2: <i,j), k (X(i,j),k + hp z (-^(i,j),fc) 


= log ^/27reVar pZ( . , ) fc [Z {i>j)ik ] 


= log^27re C 72. i) 


(71) 


for each (?', j) E £ and /;• E {1.2..... n}, where (a) follows from the fact that the differential entropy of 
a random variable X is maximized by that of the zero-mean Gaussian random variable whose variance 
is Var[X], Combining ( [67] ), ( [68] ), ( [69] ), ( |70| ) and ( fTT] ), we have 


1 1 ] 

E 5 lo g | 1 + 

ieT n n k= 1 (■ i,j)eTxT° 


E, 


P*i 




X 2 . 


cr 


(*d) 


(a) 

< 


[i E »r.,„ 

I v 2 

n <Efc=l ^{i,j),k 


(J?. ., 

(*j) 


(72) 


(i,j)eTxT c 

where (a) follows from Jensen’s inequality. Choose {ne} < fL 1 to be a subsequence of {n}^ =1 such that 
for all (i,j) E £, lim E p U EfcLi Xl J converges to some S ( * E R, i.e., 

£—>■00 (i,j) \ ij/i j \ u j 


S* {i = lim E p 

V l ’ 3 ) £—roo 


Tin 


ni 


E x h 


i),k 


k =1 


(73) 


Combining ([65]), ( 661 , (72 1 and (731, we obtain 


E>< x iiogfi + s <-» 


(77 


(74) 


ieT (ij)eTxT c \ (*d) / 

Define = (S^* ^ ■ (i, j) E 5) to be an N 2 dimensional-tuple. Since does not depend on T by ( |73| ), 
it follows from ( f74| ) that 


Ki£ P| 

TCl-.T^nV^ 


t>N 


T, R i< E 5 log ( 1 + 


i£T 


(i,j)eTxT<= 

On the other hand, we conclude from (731, (451, (46) and ( [47] ) that 


t 0.j) 


(75) 


S h) * 


for each (i.j) E £, 


N 


E s h) s « 

J=i 


(76) 


(77) 
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for each i El, and 


N N 

EE %>£ p - 

*= 1 j =1 


( 78 ) 


Combining ([56]), ( [76] ), ( [77] ) and (78), we have 


S* £ eS(P £ ,P x ,P). 


(79) 


Consequently, ( |64| ) follows from (75) and (79). Since (64) holds for all channel operation sequence n, 
([63]) also holds. 


VII. Concluding Remarks 

We investigate the MMN consisting of independent channels and propose an edge-delay model which 
allows the presence of zero-delay edges in the network. Under our model, the MMN with independent 
channels is characterized by multiple channels, which are operated in different orders so that an edge 
may incur zero delay on some other edges. Our model is a generalization of the classical model, under 
which the MMN with independent channels is characterized by a single channel and every edge incurs a 
unit delay on every adjacent edge. A well-known outer bound on the capacity region under the classical 
unit-delay assumption is the cut-set outer bound. In this paper, we prove that the MMN with independent 
channels and zero-delay edges lies within the classical cut-set bound despite a violation of the classical 
unit-delay assumption. 

Next, we use our outer bound to prove the capacity region of the MMN consisting of independent 
DMCs with zero-delay edges. More specifically, we show that the capacity region is the same as the set 
of achievable rate tuples under the classical unit-delay assumption. This capacity region result is then 
generalized to the MMN consisting of independent AWGN channels with zero-delay edges. Consequently, 
the capacity regions of the two aforementioned MMNs are not affected by the handling of delays among 
edges even when zero-delay edges are present. 


It has been shown in Section I-A that the capacity region of some two-node MMN with dependent 
channels and zero-delay edges is strictly larger than the set of achievable rate tuples under the classical 
unit-delay assumption — every edge should incur a unit delay on all the edges. Future research may 
investigate the capacity regions for general MMNs consisting of dependent channels with zero-delay 
edges. Another interesting direction for future research is extending the network equivalence theory |6] 
(which asserts the equivalence between the capacity region of any network with independent channels 
and the capacity region of the deterministic counterpart of the network) for networks with unit-delay 
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edges to networks with zero-delay edges. 
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